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Abstract 

We study solutions to the multi-marginal Monge-Kantorovich problem which are concentrated on 
several graphs over the first marginal. We first present two general conditions on the cost function 
which ensure, respectively, that any solution must concentrate on either finitely many or countably many 
graphs. We show that local differential conditions on the cost, known to imply local d-rectifiability of the 
solution, are sufficient to imply a local version of the first of our conditions. We exhibit two examples 
of cost functions satisfying our conditions, including the Coulomb cost from density functional theory in 
one dimension. We also prove a number of results relating to the uniqueness and extremality of optimal 
measures. These include a sufficient condition on a collection of graphs for any competitor in the Monge- 
Kantorovich problem concentrated on them to be extremal, and a general negative result, which shows 
that when the problem is symmetric with respect to permutations of the variables, uniqueness cannot 
occur except under very special circumstances. 


1 Introduction 

In this paper, we consider the multi-marginal Monge-Kantorovich transport problem. Given Borel probability 
measures /zi,..., /z n (the marginals) on smooth manifolds Xi ,..., X n , respectively, let II(/zi,..., /z n ) be the set 
of Borel probability measures on X\ x X 2 x ... x X n which have Vi-marginal /z,; for each i G {1, 2, ..., n}. 
Letting 

c:IixI 2 x...xI„-> R, 

be a measurable cost function, the transport cost associated to a transport plan 7 r G II(/zi, ...,/z n ) is defined 

by 

Ic( 7r) = / c(xi, ...,Xn) d-K. 

J X 1 xX 2 X...xX n 

The multi-marginal Monge-Kantorovich transport problem is then the following minimization problem: 

inf |/ c ( 7r ); 7 T G n(/z 1 ,...,/z n )|. (. MK) 

Under quite mild conditions on c and the /q, it is well known that ( MK ) admits a minimizcr. Our 
purpose here is to study the geometric structure of optimal measures 7 r G II(/zi,..., /z„). 

Note that when n = 2, (MK) is precisely the classical optimal transport problem of Monge and Kan¬ 
torovich. In this case, assuming that the first marginal /zi is absolutely continuous with respect to local 
coordinates, the celebrated twist condition implies that the solution is concentrated on a graph over X\ , and 
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is unique mmi]. While this condition holds for many important cost functions, there are also 
a variety where it fails, including any smooth cost on a compact manifold. Recently, there have been some 
developments on two marginal problems where the twist condition fails mm- In particular, one of the 
present authors developed two conditions, known as the m—twist and generalized twist conditions, which 
ensure respectively that the solution concentrates on several or countably many (rather than one) graphs 
[ 231 . A related line of recent research concerns sufficient conditions for uniqueness of the optimal measure, 
one interesting goal being to exhibit smooth costs on arbitrary manifolds for which optimal plans are unique, 
despite the fact that such plans are not generally concentrated on graphs. The interested reader is referred 
to a recent manuscript [22] and also to uniGBnii for more examples. We recall that a measure 7 in the 
convex and weakly compact set /12) is called extremal if 7 = tji + (1 — 1)^2 with 71,72 £ n(/ii, fi 2) and 

0 < t < 1 imply that 71 =72- It is well known that if (MK ) admits a unique solution, then this solution 
must be extremal in ^2)', an important aspect of the research on uniqueness is the characterization of 

extremal points of II(/xi,/Z2). It is known that the extremal points of II(/.4i,/X2) are not necessarily concen¬ 
trated on a single graph. Indeed, any extremal point of II(/zi, ^2) is known to concentrate on the union of the 
graph of a function from Xi to X2 and the antigraph of a function from X2 to X\. We refer the interested 
reader to [ 25 | where a sufficient and almost necessary condition for the structure of the support of extremal 
measures in n(//i, ^2) is established. 

Interest in the n > 3 case (also known as the multi-marginal case) has picked up recently. Here, an 
analogue of the twist condition, known as the twist on splitting sets condition has recently been developed; 
this condition ensures that the solution to the multi-marginal problem concentrates on a graph over the 
first variable x\ (and is unique) [ 191 . In contrast to the two marginal setting, where the twist condition 
is reasonably mild (at least for spaces with trivial topology), the twist on splitting sets condition is very 
strong and there are many important examples for which it fails. In particular, examples arising in density 
functional theory in physics sum and roommate type problems in economics [8] involve equal marginals and 
cost functions which are symmetric under all permutations of the arguments (see also m- For problems 
with this symmetric structure, it is known that there are always solutions that do not concentrate on 
a single graph (except in the very simple special case when the diagonal {(x,x, ...x)} supports a unique 
optimizer) | 29 j . In this paper, we focus on what can be said in the multi-marginal case when the solution 
does not concentrate on a single graph; in particular, we investigate the possibility that the solution may be 
concentrated on several graphs. 

Specifically, we consider the to— twist condition on splitting sets, and the related generalized twist on 
splitting sets condition, (see Definition ^. 2 D which are analogues of the conditions in [ 2 ) 3 ] in the two marginal 
case. These conditions were recently introduced by one of the authors in [ 24 ] . and assuming the first marginal 
is absolutely continuous with respect to local coordinates, they imply, respectively, that any optimal plan it 
concentrates on at most m graphs, or countably many graphs, over the first variable. 

The to— twist condition on splitting sets is somewhat more flexible than the twist on splitting sets condi¬ 
tion and, in particular, it seems relevant to symmetric costs which arise in the applications outlined above. 
We show, for instance, that the Coulomb cost (see ([ 5 ]) below) from density functional theory is m-twisted 
on splitting sets in one dimension. Our work therefore offers a different perspective on a recent result of 
Colombo, De Pascale and Di Marino m, who constructed explicit solutions to ( MK ) for this cost, when the 
marginals are all equal; their solutions concentrated on (n — 1)! graphs (where n is the number of marginals). 
In fact, our result also applies to the case when the marginals are not equal; on the other hand, we are able 
to prove only that solutions concentrate on at most n\ — (n — 1)! graphs. It is unclear to us whether there are 
in fact marginals for which our upper bound on the number of graphs is sharp; if so, the marginals cannot 
all be equal. 

We also demonstrate that a local differential condition on c, which is known to imply that the optimizer 
concentrates on a low dimensional subset of the product space, also implies a local version of the twist 
condition on splitting sets. This local twist on splitting sets in turn implies the generalized twist on splitting 
sets, so that each optimal plan concentrates on the union of the graphs of at most a countable number of 
maps. 

Following the recent developments on uniqueness issues for non graphical solutions in the two marginal 
case, we establish here some results which can be useful to derive uniqueness of solutions concentrated on 
several graphs in the multi-marginal setting, and exhibit some examples. We also prove a general, nega¬ 
tive result, implying that the permutation symmetric problems described above can have unique solutions 
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only in very special circumstances. For solutions concentrated on several graphs, however, these circum¬ 
stances are somewhat more flexible than in the single graph case (where only the trivial, diagonal solution 
(Id, Id, ...Id)#n has the possibility of being a unique optimizer); we provide an example of a symmetric three 
marginal problem where a unique solution concentrates on two graphs and at least one of the components 
of each is not the identity mapping. 

The next section is devoted to preliminary definitions. In the third section, we state our results for 
costs which satisfy the m and generalized twist on splitting sets conditions. We then prove that the local 
differential conditions alluded to above imply local twist on splitting sets, and outline several examples. In 
the fourth section, we prove several results related to uniqueness of solutions concentrated on multiple graphs. 
We show that under the generalized twist on splitting sets condition, a collection of several graphs containing 
the support of every optimizer is uniquely determined. We then establish conditions under which any solution 
concentrated on several graphs must be extremal, and prove that solutions to symmetric problems cannot 
be unique when n > 3 , except under very particular conditions. 

2 Definitions and preliminaries 

Here we recall several definitions which will be needed throughout the paper. We begin with the definition 
of c-splitting sets, modified slightly from Pi- 

Definition 2.1 A set S C X\ x X 2 x ... x X n is a c-splitting set (or simply a splitting set) if there exist 
Borel functions Ui : Xi —»• [— 00 , 00 ) such that for all (aq, aq, ■ ■■, x n ), 

n 

y < c(x 1,X 2 , —,x n ) (1) 

i=1 

with equality whenever (aq,aq,... ,x n ) £ S. The n-tuple (u\,...,u n ) is called the c-splitting tuple for S. 

The relevance of splitting sets to ( MK) is that, as can be easily seen from the dual formulation found in 
m , any optimal measure is concentrated on a c-splitting set. 

Next, we recall the following relaxations, introduced in [24], of the twist on splitting sets conditions from 
m- In what follows, D Xl c(x 1 , 2 : 2 , ...,i n ) denotes the differential of c with respect to the first variable aq. 

Definition 2.2 Let c be a Borel measurable function and denote by D\(c) the set of points at which c is 
differentiable with respect to the first variable. 

1. m-twist condition: We say that c is m-twisted on splitting sets if for any c-splitting set S C X\ x X 2 x 
... x X n , any aq £ X\ and any p in the cotangent space Tf^X\ to X\ at aq, the cardinality of the set 

|(aq, aq 5 —,x n ) £ S n -Ch(c); D Xl c(xi,x 2 , ...,x n ) = pj, 

is at most m. We say that c locally satisfies the m-twist condition on splitting sets if for any (aq, aq,..., x n ) £ 
S (~l -Di(c) there exists a neighborhood U C X 2 x ... x X n of (aq,... ,x n ) such that the cardinality of the set 

|y £ U, (aq ,y) £ S n Z?i(c), D Xl c(x 1 ,y) = D Xl c( aq,aq, ...,x n )|, 

is at most m. 

2. Generalized-twist condition: We say that c satisfies the generalized twist condition on splitting sets 
if for any c-splitting set S C X\ x X 2 x ... x X n , any aq £ X\ and any p in the cotangent space Tf x X 1 to 
X\ at aq, the set 

j(xi,aq,... ,x n ) e5n -Di(c); D Xl c( aq,aq, ...,x n ) = pj, 

is a finite subset of S. 

Remark 2.3 Given a c-splitting set S and splitting functions u±, ....u n , it is well known that one can con¬ 
struct new splitting functions, u\, ....u n for S such that, for each i = 1, 2...., n, we have 

Ui(xi) = inf c(aq, ...x n ) - Y' Uj(xj). (2) 

aq .a#* -rr' 
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Assuming c is globally Lipschitz, it is well known that each Ui is as well (unless Ui is identically —oo, 
in which case the splitting set is empty). Therefore, the set {c{x\ ,... x n ) = 1 fh(£i)}> which contains S, 

is closed (and hence compact if each Xi is). 

The following result provides a connection between the generalized twist condition on splitting sets and 
local m-twistedness on splitting sets. 

Proposition 2.4 Assume that c is continuously differentiable with respect to the first variable and each Xi 
is compact. If, for some m £ N, the function c is locally m-twisted on splitting sets, then c satisfies the 
generalized-twist condition on splitting sets. 

Proof. Assume that S C X\ x ... x X n is a c-splitting set. 

Fix (aii, ...,x„) £ S. We need to show that the set 

L = {(xi,X 2 , ..., X n ) £ S] Dic(xi,x 2 , ...,X n ) = D 1 c(x 1 ,x 2 , ...Zn) j, 

is finite. By Remark 12.31 we can find a compact splitting set, S, such that S C S; we will in fact prove the 
stronger statement, that 

L = {(xi,x 2 , ...,x n ) £ S; Dxc(x-l,X 2 , -,x n ) = D 1 c(x 1 ,x 2 , —x „)}, 

is finite (note that L C L ). If L is not finite there exists a countably infinite subset {(xi,x%, ...x„)}fc £ N C L. 
Since S is compact then the sequence {(xi,X2, ...x„)}fc £ N has an accumulation point (x\,x\, ... x °) £ S and 
there exists a subsequence still denoted by {(xi,x 2 , ...a;^)}fc e N such that x\ —> x° as k —> oo for i = 2, 

Since Die is continuous it follows that (xi,#®, ■•■ a; n) £ Since c is locally m-twisted on S, this leads to a 
contradiction as (xijX 1 ), ... x °) is an accumulation point of the sequence {(xi,X2, ...x^)}fegN and 

Dic(xi, x°, ...x°) = Dic[xi,X 2 , ...a;*), Vfc £ N. 


This completes the proof. □ 

We recall next an important property of splitting sets, which will be useful at various points throughout 
the paper. 

Definition 2.5 A set S C X\ x X2 x ... x X n is c-monotone if the following holds. Whenever x,x £ S, and 
P+, P_ are two non empty disjoint sets of indices such that P+ U P_ = { 1 , 2 ,..., n}, we have 

c(x) + c(x ) < c(x+, X-) + c(x+, x_). 

Here, we have decomposed x = (x+,x_) and x = (x+,x_) in the obvious way; that is, x+ = (xi)i^p + , with 
analogous definitions for x_, x+ and x_. 

It is well known that any c-splitting set is c-monotone (see mm))- 

Finally, we define precisely some notation describing measures concentrated on several graphs. 

Definition 2.6 Let X and Y be Polish spaces with Borel probability measures p on X and v on Y. We say 
that a measure 7 £ II(/Lt, v) is concentrated on the graphs of measurable maps {T,}* =1 from X to Y, if there 
exists a sequence of measurable non-negative functions { }i—i from X to R with X+=i cr,(x) = 1 (p-almost 
surely) such that for each measurable set S C X x Y, 

k r. 

t{S) = oti(x) xs{x,Tix)dp, 

i= 1 Jx 

where xs is the indicator function of the set S. In this case we write 7 = Y,i=i&i(Id x Ti) # p. 
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3 Multi-graph solutions 

The following characterization and its proof can be found in [24] : it asserts that, under the m —twist on 
splitting sets condition, any solution to the Kantorovich problem concentrates on (at most) m graphs over 
the first variable. 

Theorem 3.1 A ssume that the cost function c satisfies the m— twist condition on splitting sets, is non- 
atomic and any function Ui(xf) that is of form (J 2 ) and not identically infinite is differentiable ni-almost 
surely on its domain. Then for each optimal plan 7 of ( MK ) with Supply) C Z?i(c), there exist k < m, a 
sequence of non-negative measurable real functions {aj}k =1 on X\ and, Borel measurable maps G\,...,Gk '■ 
X\ —> X 2 x ... x X n such that 

k 

7 = ^2ai{Idx Gi) # n, (3) 

i=1 

where ]T)i=i on{x) = 1 for pi-a.e. x £ X±. 

Moreover, if one replaces the m-twist condition by the generalized-twist condition then each optimal plan 7 
of ( MK) is of the form 0 ) for some k £ N U { 00 }. 

Although this result asserts quite strong conclusions about the structure of optimizers, the m— twist on 
splitting sets (respectively, generalized twist on splitting sets) hypothesis seems quite difficult to verify. 
Below, we establish that certain local differential conditions on the cost actually suffice for the generalized 
twist on splitting sets condition. 


3.1 A differential condition for local twist on splitting sets 

Here we establish a differential condition on c which is sufficient for the local 1—twist on splitting sets 
condition. The condition first appeared in [28] , where is was shown to imply that the solutions concentrate 
on low dimensional subsets of the product space. Assume that each Xi is a d-dimensional smooth manifold. 
As in [25] , we let g be the off diagonal part of the Hessian of c; that is, in block form, 


0 D l 1X2 C 

d 1 2X1 c 0 D 2 X2X3 c 


where each D x . Xj c = { d d a c a )a/ 3 , for i ^ j, is the d x d matrix of mixed, second order partial derivatives 

We recall that the signature of a symmetric matrix is the ordered triplet, (A + ,A_,Ao), where A + , A_ and 
Ao denote respectively the number of positive and negative eigenvalues, and the multiplicity of the zero 
eigenvalue. 

We then have the following. 


D xi x n c 

^X 2 X n C 


(4) 


Proposition 3.2 Let c be twice differentiable. The following assertions hold: 

1. If the symmetric, mn x mn matrix g has signature (nd — d, d, 0) then c is locally 1—twisted on splitting 
sets. 


2. If in addition each Xi is compact then c satisfies the generalized twist condition on splitting sets. 

Proof. We first prove assertion 1. The proof is by contradiction; to this end, assume that c is not locally 
1—twisted on splitting sets. It follows that there exists a splitting set S and a point (xi,x 2 , ■■■,x n ) £ S such 
that the set 


|(z 2 , -,x n ) : (xi,x 2 , ..;X n ) £ S, D Xl c(X!,x 2 , —,x n ) = D Xl c{xi,x 2 , —,x n )j 

intersects any open neighbourhood of (x 2 , ■■■,x n ) (at a point other than ( x 2 , ...,x n )). We can therefore take 
a sequence (x$, ... ,x ^) in this set converging to (x 2 ,...,x n ), such that (a;|, ...,a;£) 7 ^ (x 2 , ...,s„) for all k. 
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Set 


k {xl,...,x*)-(x 2 ,...,x n ) 

V — -. 

\(x^,...,x k ) - {x 2 ,...,x n )[ 

As each v k has unit norm, we may pass to a convergent subsequence, so that v k —» v = [v 2 ,v n ) ^ 0. Note 
that the vector (0,v) = (0, v 2 , • v n ) is then tangent to S at (xi,x 2 ,...,x n ). 

Furthermore, the splitting set S is necessarily c— monotone, 
and so, by a result in [28] we have 


(0, v) T -g-(0 ,v) <0 

This is equivalent to 


E v i- D l iXj c-Vj <0. 


Now, since for each k we have 


D Xl c(x i,x 2 , -,x n ) - D Xl c(x$,x%, -;X k ) 
\(x k ,...,x k ) - (x 2 ,...,x n )\ 


= 0 , 


^n) 

taking the limit as k —^ oo yields 

n 

0 = ^2 D l lXj c-v j . 

j=2 

But now let w G T Sl X i, set, and consider the vector u = (w, 0,0,..., 0) + s(0, v). We then have 


k n n 

u t -g-u= ^2 D l iXj c-Uj =2sw -^2Dl iXj c- -vj + ^ u* • D 2 XiXj c ■ Vj = sV • g ■ v < 0. 

We have therefore found a d+1 dimensional vector space, T $1 Ai ®span(u) C T SlXS2> ... >Sn Xi xl 2 x 
on which g is negative definite, contradicting the assumption that g has only d timelike directions, and com¬ 
pleting the proof of assertion 1. Proposition 12.41 then immediately implies the second assertion. □ 

The last proposition together with Theorem 13.11 imply that each optimal plan is concentrated on the 
union of the graphs of at most a countable number of maps. 


3.2 Application: the one dimensional Coulomb cost with n marginals. 

Let c : K" —> R be the one dimensional Coulomb cost; that is, take each Xi = 1R. and set 


c(x i, ...,x n ) 


E 



V(xi, ...,x n ) G R". 


(5) 


This cost has important applications in density functional theory in physics cm a recent paper by 
Colombo, De Pascale and Di Marino [TO] solves the optimal transport problem with this cost explicitly when 
the marginals /i, are all the same, which is the most important case in view of physical applications. The 
solutions they exhibit all concentrate on several graphs. Here, we show that this cost satisfies the m twist 
on splitting sets condition, which, by virtue of Theorem 13.II uncovers a new perspective on their result, as 
well as showing that the multi-graph solution structure persists to the setting where the marginals differ. 


Proposition 3.3 The one dimensional Colulomb cost © is n\ — (n — 1)! twisted on splitting sets. 
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Proof. Let S be a splitting set and u\, ...,u n the corresponding splitting functions. Fix x\ 6 K and 
As the splitting functions never take on the value +oo, the inequality 0 implies that there is no point 
xi, ...x n £ S with Xi = Xj for some i ^ j. We first prove that, for each permutation a on n letters, there is 
at most one (X 2 , x n ) £ X 2 x ... x X n , such that D xi c(x 1 , ...x n ) = p and 

(xi,X 2 , —x m ) GSfl A a 

where 

A a = {(xi,x 2 , ...Xn) : < x„y), Vi < j}. 

Now, suppose we have two such points (xi,X 2 ,... x n ) and (xi,X 2 , ...,x„) in S f~l A a with 

D Xl c(x 1,x 2 , —X n ) = p= D Xl c(x 1,x 2 ,..., Xn). 

We claim that either Xj > x* for all * = 2, ...n or vice versa. As it is easy to check that Xj >-» D Xl c(x 1 , X 2 ,..., x n ) 
is strictly monotone decreasing on A,j for each i, this will establish the result (as if, for example, Xj > x l for 
all i = 2, ...n, then the monotonicity implies D xi c(x i,X 2 , ...x n ) < D Xl c(xi, X 2 , ...,x„), with strict inequality 
as long as Xj > Xj for at least one *). 

To see the claim, we use the fact that the splitting set 5 is c-monotone. Assume that the claim is false; 
then there exists non empty disjoint sets P+, P_ C {1, 2,..., n} such that P+ U P_ = {1, 2,..., n}, and x* > x, 
for i in P+, x* < Xj for i in P_, and at least one of the inequalities is strict in each set. 

For ease of notation, we decompose (xi, X 2 , ...x n ) = (x+,x_) and (xi,x’ 2 ,..., x n ) = (x+,x_), (with the 
same meaning for x+,x_,x+ and x_ as in Definition 12.51 
By the c-montonicity property, we have 


c(x+,x_) + c(x+,x_) < c(x+,x_) + c(x+,x_), 


which is equivalent to 


E 

iGP+JGP- 




< E 

i&P+J&P- 




( 6 ) 


Now, for a fixed * G P+, j £ P_, consider the paths Xj(f) = Xj + f(xj — Xj) and Xj(s) = Xj + s(xj — Xj). 
As both x and x are in the same A a we can assume without loss of generality that x, < Xj and Xj < Xj. 
Together with the fact that (5j — Xj)(xj — Xj) < 0, (which follows from i being in P+ and j in P_) this 
implies that for each point (t, s ) G [0, l] 2 we have Xi(t) < Xj (s). We therefore have 


d 2 

1 

d 2 

1 

dsdt 

l\Xi(t) - Xj{s)\\ 

dsdt 

-Xj (s) - Xj (t). 


Now note that this implies 


= —2 


Xj(s) -Xi(t)] 3 


(xi - Xi)(Xj - Xj) 


0. 


1 1 

1 1 


r 1 r 1 d 2 

1 

Xi-Xj I ' I Xj - Xj 

|xj—Xjj |xj—Xj| 


J 0 Jo dsdt 

L|Xj(f) — Xj (s) | J 


dsdt 


0 . 


Furthermore, the inequality is strict for at least one i G P+ and one j G P_; summing over i G P+ and 
j G P_, this violates ©; this contradiction establishes the claim. 

So far, we have proven that we can have at most one point (x 2 , ■■■X n ) such that (xi,X 2 , ...x n ) G S fl A a 
and D Xl c(x 1 , ...x n ) = p for each permutation a. 

Now, note that if p > 0, we cannot have any solutions to p = D Xl c(x i,X 2 , ...x n ) in the region where 
Xi < Xj for all * = 2,3, ...n (as clearly D Xl c(xi, X 2 , ...x n ) < 0 there). A similar argument applies to the 
region X\ < Xj for all i = 2,3, ...n when p < 0. Therefore, we do not have any solutions in A a when er(l) = 1 
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in the first case, or when cr(l) = n in the second case. In either case, there are (n — 1)! permutations with 
the appropriate property. 

This means that there are at most n! — (n— 1)! permutations a for which we potentially have one solution 
(x 2 , ■ ■■X n ) to D Xl c(x i,... x n ) =pon SH A a . We therefore have at most n! — (n — 1)! solutions to the equation 
p = D Xl c{x\,x 2 ,... x n ) on the splitting set S . 

□ 


Corollary 3.4 Assume that c is finite pi ® ... ( 8 > p n almost everywhere and there exists a finite transport 
plan 7 To for ( C ). If p i is non-atomic and absolutely continuous with respect to the 1 —dimensional Lebesgue 
measure then there exist k <n\ — (n — 1 )!, a sequence of non-negative measurable real functions {cq}i=i on 
R and, Borel measurable maps G i,..., Gk '■ R —> R ra_1 such that 


k 

n 0 = ^2 a i( Id x (7) 

i=1 


where &%{?) — 1 f or Pi~a.e. x G R. 

Proof. By a similar argument as in the proof of Theorem (4) in [4], there exist potentials u\,...,u n with 
Ui G C l (p,i), taking values in [—oo, oo) and c(xi,..., x n ) > u ( x i) such that 

/ c(xi, ...x n ) dn 0 = ^2 / u{xi)dpi, 

J i” i=1 Jr 

and u\ is differentiable p\ almost everywhere. It also follows from Proposition 13.31 that c is n! — (n — 1)! 
twisted in splitting sets. Thus, the result follows from Theorem 13.II □ 

As shown in m, when the marginals arc all equal (the physically relevant case) the solution is concen¬ 
trated on (n — 1 )! graphs; our result guarantees it is concentrated on at most n\ — (n — 1 )!, even when the 
marginals differ. It is unclear to us whether this is sharp; that is, whether there really are solutions which 
concentrate on n! — (n — 1)! graphs. If so, this can only happen when the marginals are not all equal. 


3.3 A cost satisfying local 1—twistedness 

We exhibit now an example of a multi-dimensional cost function which is locally (but not globally) 1—twisted 
on splitting sets, together with a solution to the optimal transport problem which concentrates on several 
(rather than one) graphs. 

Proposition 3.5 Take, for x = (x x ,x 2 ), y = (y 1 , y 2 ), z = (z 1 ^ 2 ) G R 2 , 

c(x, y, z ) = — e x +v cos(x 2 — y 2 ) — e x cos(a ; 2 — z 2 ) — e v +z cos (z 2 — y 2 ). (8) 

Then c is locally twisted on splitting sets. 

Proof. We first recall that, for three marginal costs, the condition 

D ly c [ D ly c ]~ lD lx c < 0 

is equivalent to the matrix g in (|2]) having signature (27c, 2,0) (see [2S3); hence, by Proposition 13.21 this 
condition implies the local twist on splitting sets property). We will show that this condition holds for the 
cost ©. Note that 

e xl+yl cos(x 2 — y 2 ) e xl+yl sin(x 2 — y 2 ) 

—e xl+yl sin(x 2 — y 2 ) e xl+yl cos(x 2 — y 2 ) 
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Up to multiplication by — e x +v , this is a rotation matrix through the angle x 2 — y 2 . Similarly, D 2 z c is 
— e yl+zl multiplied by a rotation through y 2 — z 2 , so its inverse transpose, [-D 2 y c] _1 , is — multiplied 

by the rotation through y 2 — z 2 (recall that, for a rotation matrix A , A T = A -1 ). Finally, D 2 zx c is — e xl+zl 
multiplied by a rotation through z 2 — x 2 . Therefore, the product D 2 y c[D 2 y c\~ 1 D 2 x c is, up to a negative 
multiplicative constant, rotation through x 2 — y 2 + y 2 — z 2 + z 2 — x 2 = 0; that is, the product is a multiple 
of the identity: 


D 2 c[D 2 c}- 1 D 2 c zx c=-e xl+ y 1 


e yl+z 


-e xl+zl I < 0 . 


Therefore, by Proposition 13.21 this cost is locally twisted on splitting sets. 


□ 


In fact, for this cost, we can exhibit explicitly an optimizer which concentrates on several graphs. To this 
end, note that 


-e xl+yl cos(x 2 - y 2 ) > -e* 1+yl > -e 2 * 1 /2 - e 2yl /2 

and we have equality only when x 1 = y 1 and x 2 — y 2 is an integer multiple of 2-7T. Applying similar reasoning 
to the other pieces of c, we get 

c(x,y,z) > -e 2xl - e 2yl - e 2zl 

with equality only when 

(x, y,z) £ S = {(a;, y, z) : x 1 = y 1 = z 1 and x 2 — y = 2kn , x 2 — z 2 = 2ln for some integers k, 1} 

This easily implies that any measure 7 concentrated on the set S is optimal in (MAT) for its marginals, as 
— e 2x , —e 2y and — e 2z serve as Kantorovich potentials. 

Locally, the set S looks like a graph, but globally, it is the union of countably many graphs (or finitely 
many, if we restrict to compact domains). 

4 Uniqueness issues 

In this section, we consider the uniqueness of solutions concentrated on several graphs. We begin with a 
criterion for extremality of measures of type © in n(/ii,..., y n ). Note that, as the Kantorovich problem is 
a minimization of a linear functional over a convex set II(/xi, ...,/x n ), it necessarily has at least one solution 
which is extremal in that set. Extremality of a solution 7 £ II(/xi, y n ) to ( MK ) is therefore a necessary 
condition for 7 to be the unique solution. 

Theorem 4.1 Let X\, A 2 , A 3 be Polish spaces equipped with Borel probability measures fii on Xi, and let 
{Gi = ( Hi , be a finite sequence of measurable maps from X\ to X 2 x X 3 . Assume that the following 

assertions hold: 

(i) For each i > 1 the map Hi is injective and onto. 

(ii) For each j > 2 the map Kj is injective and Ran(Ki) n Ran(Kj) = 0 for all i > 2 with i j. 

(Hi) There exists a bounded measurable function 9 : X 3 —>• R with the property that for each i > 2, 

9(Ki o Hf 1 (x 2 )) > 9{Ki 0 H~ 2 {x 2 )) Vx 2 £ Dom[Ki o Hf 1 ') n Uom(AT i o FT” 1 )- 

Then each 7 £ II(/xi, p. 3 ) that is concentrated on U i = 1 Graph(Gi) is an extremal point of fj. 2 i ^ 3 )- 

The proof of the preceding theorem is fairly long and so is deferred to the end of this section (see 
subsection l4.2D . We note that we have stated the result for n = 3 only to keep the presentation as simple as 
possible; it has a straight forward generalization to larger n. 

The theorem may be used in certain circumstances to deduce uniqueness of the optimizer; in particular, 
if one can show that all solutions must be concentrated on a single collection of several graphs, satisfying 
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the hypotheses in the Theorem, then uniqueness follows immediately. The following example illustrates this 
concept. 

To the best of our knowledge, this is the first example of a solution to a multi-marginal optimal transport 
problem which is both 1) unique and 2) not concentrated on a single graph. Although the example is 
admittedly somewhat artificial, we still believe it is of interest as this type of behavior has not been observed 
before in multi-marginal problems. 

Example 4.2 A unique solution concentrated on two graphs 

Let c[x, y , z) = (x — y) 2 + (x — z) 2 (x — z + 1/2 ) 2 on X\ x X 2 x A 3 = [0,1] x [0,1] x [0, 3/2]. Consider 
the maps (Hi,Kf) : X± —>• X 2 x A 3 given by (H±, K±)(x) = (x,x) and (# 2 , A 2 ) : X± —>• A 2 x A 3 given by 
(H 2 , K 2 ){x) = (x, x + |). We note that 


c(x,y,z) > 0 

with equality if and only if ( y,z ) = (. Hi,Kf)(x ), for i = 1 or 2. Therefore, letting 7 be uniform measure 
(renormalized to have total mass 1) on the union S = graph(Hi, K{) U graph(H 2 , K 2 ) of these two graphs, 
we have that 7 is optimal for it’s marginals gi,g 2 ,h 3 - To see that 7 is in fact the unique optimizer, assume 
that 7 is any other optimizer; it must necessarily concentrate on the same set S. The interpolant ^(7 + 7 ) 
then also concentrates on S and must the be optimal too. However, it is easy to see that ( Hi,Ki ) satisfy 
the conditions in Theorem |/.i| (taking, for instance, 9(z) = —z), and so §(7 + 7 ) must be extremal in 
n(pi,P 2 ,ti 3 ), a contradiction as it is the average of two distinct measures 7,7 6 II(/xi, p 2 , Ps)- Therefore, 7 
is the unique optimal measure, as desired. 


We now record a result that shows that, under the m-twisted on splitting sets criterion, uniqueness can only 
fail by reorganizing the mass within a single collection of m graphs. 

Theorem 4.3 Suppose that the continuously differentiable function c satisfies the m-twist condition on 
splitting sets, gi is non-atomic and any function u\(x\) of the form © which is not identically infinite is 
differentiable pi-almost surely on its domain. Assume that {Gi}(T 1 is a sequence of measurable functions 
from Ai to X 2 x ... x A„ such that for each i j the set {s; Gi(x) = Gj(x)} is p±-negligible. Let 7 be an 
optimal plan for ( MK ) such that 

m 

7 = ^a k (Id x (a*(®i) > 0 and a 1 (x 1 )a 2 (x 1 )...a rn (x 1 ) ^ 0 for (j ,i - a.e. x x G X ± ). (9) 

k =1 

Then for any other optimal plan 7 we have 


Supply) C Supp{ 7 ). 

Proof. By Kantorovich duality (see [121 7; for a proof in the multi-marginal case), there exist functions 
Pi £ i = l, ...,n such that 

Pi(xi) = inf |c(xi, ...,x n ) 

/ cdy = y~] / ipi{xi)dp,i. 

J t= 1 Jx > 

Let S be the c-splitting set generated by the n-tuple {p\,..., ip n ), that is, 

n 

S = j(xi, ...,X n ); c(xi,...,x n ) = ^ Pi{Xj)^. 


for all Xj £ A, and 


»=1 


It follows that 


J c(x \,..., x n ) d'y = J y^ y p i (xj)d'y, 
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from which we obtain 


y; / a i (xi)c(x 1 ,G i xi)d^ 1 = y / «i(xi) y>i(xi) + ip(GiXi) 

fc=1 fe=i 

where ij)(x 2 , ■■■,x n ) = Ti{ x i)- It then follows that 


dpi, 


y I oti{x 1) c(xi,GjXi)-ipi(xi)-^(GjXi) 

-X’l 


fc=l 


d/r 1 = 0. 


Since each integrand in the latter expression is non-negative and almost surely ai(xi)a 2 (xi)...o: m (xi) ^ 0, 
we have that 

c(xi, GjXi) = <^(xi) + il>(GiXi) /Ji — a.e. Vi G {1,m}. 

Consequently we obtain, 

^xiCfxijGjXi) = £>¥>i(xi) hi- a.e. Vi G {1, m}. (10) 


Note also that for i ^ j the set {xi G Xp, Gfix 1 ) = Gj(x 1 )} is a null set with respect to the measure Hi - This 
together with mi and the m-twist condition on S imply that the cardinality of the set {G 1 X 1 , ...,G m Xi} is 
m for p,i-a.e. Xi G X\. 

Now assume that 7 is also an optimal plan of ( MK ). It follows from Theorem 13.II that there exist a sequence 
of non-negative functions and, Borel measurable maps Ti, ...,T m : X\ —> X 2 x ... x X n such that 

m 

7 = A(Id x Ti)#Hi- 

i =1 

By a similar argument as above one obtains 

Pi{x)[D Xl c(xi,Tix) - Dip 1 (xi)\ = 0 /ii - a.e. Vi G {1,m}. 

For each i define fli = {xi G X±; /3j(x 1 ) ^ 0}. Since the cardinality of the set {G 1 X 1 ,..., G m xi} is m for ^i-a.e. 
Xi G Xi and since c satisfies the m-twist condition on S we have that for each i, {TiX 1 } C {G 1 X 1 ,..., G m xi} 
for /zi-a.e. Xi G 11,;. This completes the proof. □ 


4.1 Permutation symmetric problems 

Finally, we turn our attention to permutation symmetric problems; that is, problems for which the cost 
is symmetric under permutation of its arguments, and the marginal are all the same. As discussed in the 
introduction, these problems have important applications in physics and economics. 

We first demonstrate with an example that it is in fact possible to have unique solutions to these problems 
concentrated on several graphs. In [25] , it was noted that a problem of this type could not admit a unique 
solution concentrated on a single graph, {(x,F 2 (x),...,F n (x)}, unless each component of that graph was 
the identity almost everywhere, Fi(x) = x for a.e. x, for each i = 2,...,m. The example below illustrates 
that one can have somewhat less trivial unique solutions if we relax the structural requirements to being 
concentrated on two graphs instead of one. However, as Proposition 14.51 below implies, unique optimizers 
are still fairly special; this proposition asserts that one can not have unique solutions, except under very 
particular conditions. 

Example 4.4 (Another unique solution concentrated on two graphs) 

Take c(x,y,z) = xyz on [—1,1], with each marginal equal to 

T ~ g [£[-1,0] + 2 £[o,i]] 

Define Gi : [—1,1] — > [—1, l] 2 by Gi(x) = (— x, |x|) and G 2 '■ [—1,1] —t [—1, l] 2 by G 2 (x) = (x, — |x|). Note 
that by the geometric-arithmetic mean inequality, we have 
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ra- M 3 Izl 


+ — + — 


with equality only when |ai| = |y| = |^| and so 


c{x,y,z) 


xyz > — \xyz\ > 



M! 

3 



with equality only when either one or three of x,y and z are non-positive, and |cc| = \y\ = \z\; that is, we 
have equality precisely on the graphs of G i and G 2 . Therefore, if we can find a 7 £ H(y, y, y) concentrated 
on these two graphs, it is optimal; if there is a unique such 7, then the optimal measure is unique (as any 
probability measure concentrated outside these sets has larger total cost). 

Now, finding a measure 7 £ II(/z, y, y) of the form 


7 = ai (x)(Id x G\)#y + a 2 (x)(Id x G 2 )#y (11) 

amounts to solving, for almost every x > 0, the system of equations: 


ai(x) + a 2 (x) = 1 

ai(—x)+a 2 (—x) = 1 

12 2 
-ai(-x) + —a 2 (x) = | 

2 . . 1 . . 1 

-ai(x) + - a 2 {-x) = - 

2 1 2 
-ai(j;) +-ai(-x) = - 

2 . . i . . 1 

-a 2 (®) + ^a 2 {-x) = 

A straightforward calculation shows that the unique solution to these equations is given by 

ai(x) = | | 

and 

a 2 (x) = | 0 

The measure given by equation (ED. with these coefficients, is therefore the unique solution. 

The preceding example demonstrates that it is possible to have a unique solution to a symmetric problem 
which concentrates on two graphs. The components of these graphs need not be the identity, but note that 
for each Gi = ( Hi,Ki) and each x, one of the following holds: 

1. Hi(x) = x, 

2. Ki(x) = x, or, 

3. Hi(x) = Ki{x). 

In particular, y(5i x S 2 x S 3 ) = 0 whenever the sets Si, S2, S3 C [—1,1] are pairwise disjoint. This is 
essentially the only way that multiple maps can support a unique solution to a symmetric problem, as the 
following result shows. 

Proposition 4.5 Let n = 3. Suppose that the spaces Xi := X and marginals yi := y are the same for each 
i, and that c(x, y, z) is symmetric with respect to any permutation of the arguments. Assume that there exist 
mutually disjoint sets Si,S 2 ,S 3 C X, and an optimal measure 7 that charges S := S\ x S 2 x S 3 ; that is, 
7(S) > 0. Then the solution is nonunique. 


x > 0 
x < 0, 

x > 0 
x < 0. 
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As should be clear from the proof, a similar result holds for n > 4. 

Proof. The proof is by contradiction; assume that 7 is the unique solution. Letting a be any permutation 
on the variables (ir, y, z), this uniqueness implies that <r #7 = 7 . 

Now, note that 


< 7 #(,'ySixS 2 xS 3 ) = 7er(SixS 2 xS 3 )> 

where '}SixS 2 xS 3 denotes the restriction of 7 to the set 5i x S 2 x S 3 . Therefore, by the symmetry of c, we 
have 

/ c{x,y,z)d'y Sl xS 2 xS 3 = c{x,y,z)d(a # ('y Sl xS 2 xS 3 ))= c{x,y, z)d'y a{Sl xS 2 xS 3 )- (12) 

J X 3 J X 3 JX 3 

It is clear that each "f a (SixS 2 xS 3 ) fs h and that the measures 

~a ^2 TaOSiXSsxSa) 


&&S 3 


and 


1 


[ 7 S 1 XS 2 XS 3 + 7 S 3 XS 1 XS 2 +7S 2 xS 3 xSi] 


share the same marginals, where S 3 is the set of permutations on 3 letters. It then follows that 
7 = 7+^ X! 7<t(SixS 2 xS 3 ) - o[7SixS 2 xS 3 +7S 3 xSixS 2 +7S 2 xS 3 xSi] 


<t£S 3 


is a nonnegative measure with the same marginals as 7 . On the other hand, 7 7 ^ 7 , as 
X X] 7<t(SixS 2 xS 3 ) 7 ^ o[7SixS 2 xS 3 +7S 3 xSixS 2 +7S 2 xS 3 xSi]- 


3 


Finally, as by (fl2l) . 

/ c(x,y,z)d'y= c(x,y,z)d ; y, 
Jx 3 Jx 3 

7 is also optimal, which contradicts uniqueness. 


□ 


4.2 Extremal measures with fixed marginals 

We shall now prove Theorem Id.11 the proof requires a few preliminary results. We begin with the following 
result from [25l . 

Theorem 4.6 Let X and Y be Polish spaces equipped with Borel probability measures p on X and v on Y, 
and let {TjjfLi be a (possibly infinite) sequence of measurable maps from X to Y. Assume that the following 
assertions hold: 

(i) For each i >2 the map Ti is injective on the set 

Di := {a; 6 Dom(Ti) 0 Dom{Ti ); T\X 7^ TiX }, 

and Ran(Ti) 0 Ran(Tj) = 0 for all i, j > 2 with i 7^ j. 

(ii) There exists a bounded measurable function 9 : Y —> R with the property that 9{T\X ) > 9(Tix) on Di. 

Then there exists at most one 7 £ II(/x, u) that is concentrated on ujL 1 Grap/i(Tj). Moreover, any 7 £ II(/r, v) 
that is concentrated on ujL 1 Grap/i(Ti) is an extremal point ofYY(p,v). 

Our proof exploits this result, together with a connection between extremality in the two and three 
marginal cases, established in the following lemma. 
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Lemma 4.7 Let 7 be a measure in II(/u.i, /X 2 , 4 * 3 ) and let v be the projection of 7 onto X 2 x A 3 . If 7 zs an 
extremal point of n(/xi, v) and 12 is an extremal point of n(/u. 2 , /^ 3 ) thenj is an extremal point ofH(pi,fi 2 ,P 3 )- 

Proof. Let 71,72 £ n(^i,/z 2 ,M3) and 0 < t < 1 be such that 7 = f7i + (1 — t) 72. For i = 1,2 denote 
by Z7 the projection of 7j on A 2 x A 3 . It is obvious that zt.,^ £ n(/Z2,M3)> and that z/ = tv\ + (1 — t)v 2 - 
Since v is an extremal point of H(p 2 , ^3) and 0 < t < 1 we must have v = u\ = v 2l which meands that 
71,72 £ IlOi, z/). On the other hand 7 is an extremal point of II(mi, z/); together with 7 = <71 + (1 — 4)72, 
this implies 7 = 71 =72, from which we obtain the desired conclusion. □ 

Let (X, T,i, fi) and (Y, E 2 ,zz) be two measure spaces. A map T : X —> Y is said to be a push forward 
from p, to z/, dented by X^/z = z/, if 

V/eL^zz), f f(Tx) dp = f f{y)dv , 

lx JY 

where L 1 (zz) is the set of z/—integrable functions on Y. 

Proof of Theorem 14.11 Let 7 £ n(pi, ^ 2 , M 3 ) be a triply stochastic measure that is concentrated on 
U !? =1 Graph(Gi). By definition there exist non-negative Borel measurable real functions on : X\ —> M with 
Eti a i( x 1 ) = 1 such that 7 = Ez=i a i(Id x G,)#m l) i- e -i 

/ = / on(xi)f (x\,HiXi,KiXi) dp\. 

J j=1 dX* 

Denote by z' the projection of 7 on A " 2 x A 3 . We proceed with the proof in several steps. 


54ep L In this step we show that the projection v £ II(/Z 2 ,M 3 ) °f 7 onto A 2 x A 3 is concentrated on 
yj^ = 1 Graph{Ki oHf 1 ). For 1 < i < k, define the measures Mi, 1 and M 2 ,i by dp,i y 1 = ctidpi and M 2 ,i = i,i- 
For every measurable set B £ £>(A 2 ) we have 


k 

M2 (B)=7(A 1 xBxA 3 ) = ^ 

2—1 




/» 

F / XB(HiXi) d/jLi ti 

*= 1 

Al A 

/ Xb(x 2 ) dix 2 ,i 

i =1 

k n k 

dp,2,i=^2p2,i( B ) 

i=1 j=i 


Thus, for all £? £ B(X 2 ) we have y 2 (B) = E!"=i M2,z(-B)- It implies that each ^ 2 ,* is absolutely continuous 
with respect to /i 2 . Therefore, for each z there exists a nonnegative real function fit : A 2 —> K such that 
dy 2l i = /3idp 2 . This together with p 2 (B) = Ei=i M 2 ,i(-B) yield that 


/e ^ 

M 2 (5) = y2 Pi(x 2 )dfe 2 , 
i=i 


VS £ B(A 2 ), 
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from which we obtain JT =1 Pi( x 2 ) = 1 for P 2 almost every x 2 £ X 2 . Take a measurable bounded function 
/ : X 2 x X 3 —> R. It follows that 

/c 

/ f( x 2 , x 3 ) dv = / f(x 2 ,x 3 )dj= = X] / ai(xi)f(HiXi,KiXi)dni 

JX o xXa Jx " r ' r 


/X 1 XX 2 XX 3 


i=l 
k 


= y2 f{HiXi,KiXi)dm t i 
i= 1 

/c ^ 

= V' / f{x 2 ,KiO H~ 1 X2)dn2,i 

i= 1 Jx 1 

= / Pi( X 2 )f(X2,Ki oH~ 1 X 2 )d[l2- 

A _ 1 J .Xo 


This implies that ^ = JT =1 /3j(/d x KiO H i 1 )#P 2 - 


Step 2. In this step we show that ^ is an extremal point of n(^t 2 ,/x 3 ). We shall make use of Theorem 14.61 
to prove this. By assumption we have that Hi is injective and Ran(Ki) n Ran(Kj) = 0 for all i,j> 2 with 
i ^ j. This implies that if,; o H” 1 is injective for all i > 2 and Ran(Ki o If” 1 ) fl Ran(Kj o Hr 1 ) = 0 from 
which assumption (i) of Theorem 14.61 follows. The second assumption in Theorem 14.61 also follows due to 
hypothesis (Hi) in Theorem 14.11 and therefore the extremality of v in 11 (^ 2 , 4 * 3 ) follows. 

Step 3. We show that 7 is an extremal point of II(/zi, v). We shall again use Theorem 14.61 to prove this 
part. Note that Gj = (Hi, Ki) is injective for i > 2 as both Hi and Ki are injective. For i,j> 2 with i /j 
we have Ran(Gi) PI Ran(Gj ) = 0 as by assumption Ran(Ki) fl Ran(Kj) = 0. Define ip : X 2 x X 3 —> R by 


p(x 2 ,x 3 ) = 


_ J (k- l)9(x 3 ) - J2i =2 6 ( K i 0 H i x 2 ), ^3 £ -X 3 \ Uf =2 Han(Hi), 


d(x 3 ) - 9(Kj o H, x 2 ), 


X 3 £ Ran(Kj) for some j > 2. 


Since H,; is injective for i > 2 it follows that Ran(Ki) is Borel measurable ([3], Theorem 6.8.6). Thus, 
is a bounded Borel measurable function. We show that p(G\(xi)) > <p(Gj(x 1)) for all j > 2 and 
Xi £ Hom(Gi) fl Dom(Gj). Fix x\ £ Dom(G\) fl Dom(Gj). If HiXi £ X 3 \ G\ =2 R(mg(Ki) it follows that 

k 

ip(Gi(x\)) = ¥j(HiXi,HiXi) = (k - l)fl(HiXi) - ^ 0(Kj o H” 1 o HjXi) 

2—2 

k 

> (k — l)6(KiX\) — ^ #(Hi o Hf 1 o H1X1) 

2—2 

= (fc - l)(9(HiXi) - (fc - l)0(HiXi) = 0, 

where in the second line we have used the fact that 9 o KiO HJ 1 < 0 o Hi o Hj -1 . If now H1X1 £ Ran(Kj) 
for some j > 2 we obtain 

<p(Gi(xi)) = p(HiX\ 1 H1X1 ) = 0(HlXi) — 9(Kj o HJ 1 o H1X1) 

> 0(KiXi) — 0(Ki o Hj 1 o H\X\) 

= 9(Kix\) — Q(Kix{) = 0. 

It then follows that in both cases we have <p(Gi(x\)) > 0. We now sow that <p(Gj(x 1)) = 0. In fact, 

<p(Gj(x 1)) = ip(HjXi, KjXi) = 9(KjX 1) — 9(Kj o HJ 1 o HjxJ) = 0. 

Therefore for each 27 £ Dom(G\) fl Dom(Gj ) we have 

<^(Gi(xi)) > 0 = ^(Gj(xi)). 

It now follows from Theorem 14.61 that 7 is an extremal point of n(/ri, v). Therefore, it follows from Lemma 
14.71 together with steps 2 and 3 that 7 is an extremal point of n(/ii,/z 2 ,/X3). □ 
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